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Motivated by studies of typical properties of quantum states in statistical mechanics, we introduce 
phase-random states, an ensemble of pure states with fixed amplitudes and uniformly distributed 
phases in a fixed basis. We first give a sufficient condition for canonical states to typically appear in 
subsystems of phase-random states, which reveals a trade-off relation between the initial state in the 
bounded energy subspace and the energy eigenstates that define that subspace. We then investigate 
the simulat ability of phase-random states, which is directly related to that of time evolution in closed 
systems, by studying their entanglement properties. We find that starting from a separable state, 
time evolutions under Hamiltonians composed of only separable eigenstates generate extremely high 
entanglement and are difficult to simulate with matrix product states. We also show that random 
quantum circuits consisting of only two-qubit diagonal unitaries can generate an ensemble with the 
same average entanglement as phase-random states. 
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I. INTRODUCTION 

One of the goals of quantum many-body physics is to 
be able to compute properties such as expectation val- 
ues of observables, entanglement and explicit state de- 
scriptions for physical systems composed of many par- 
ticles. These properties depend upon the parameters of 
the quantum states involved, and calculations are made 
difficult by the fact that the number of these parame- 
ters grows exponentially with the number of particles. 
One way around this problem is to consider ensembles of 
states as opposed to individual states, as many of those 
parameters are then averaged out. 

The most natural and well-studied ensemble of states 
is that of random states^ the set of pure states in Hilbert 
space selected randomly from the unitarily invariant dis- 
tribution, used in many areas of quantum physics and 
quantum information science [l|-[3|. The entanglement 
of random states is an area of particular recent interest, 
where it has been shown that in large systems, the aver- 
age amount of entanglement is nearly maximal according 
to several measures [5| . In the context of quantum statis- 
tical mechanics, the high entanglement of random states 
has been shown to lead to reduced states that are ther- 
mal, under their restriction to a subspace constrained by 
the amount of total energy [6[. 

Symmetry is often used as a constraint, however there 
are other ways to place useful restrictions on an ensemble. 
If states are restricted to a certain subspace of Hilbert 
space, an ensemble of states could be described by ran- 
dom states in that subspace. In this paper, we address 
another type of restriction, which leads to ensembles of 
states in a subset of a Hilbert space. A simple example of 
a subset of states that is not a subspace is the set of prod- 
uct states, as they are not closed under superpositions. 
A more interesting example is a state evolving under a 
time-independent Hamiltonian. Since the time evolution 



changes only the phases of the expansion coefficients in 
the Hamiltonian's eigenbasis, states reached during the 
time evolution form a subset rather than a subspace. Our 
aim is to study random states in such a subset, namely, 
an ensemble of states where the randomness is restricted 
to the phase of the complex expansion coefficients in a 
given basis, which we call phase-random states. 

Phase-random states are closely connected to studies 
of typical properties in statistical mechanics [l|, . States 
in which the phases are the pertinent degree of freedom 
appear in quantum information theory such as in in- 
stantaneously quantum polynomial time (IQP) circuits 
[8], and locally maximally entanglable (LME) states [9[. 
Phase-random states also describe situations where the 
accessible information is limited to the amplitudes in a 
certain basis, for instance, those where a unique rank-1 
measurement is allowed. 

Motivated by these considerations, we investigate sta- 
tistical properties of phase-random states, which clearly 
depend on the amplitudes of the coefficients as well as 
the expansion basis. We first demonstrate the thermal- 
ization of their reduced states, which implies potential 
uses of phase-random ensembles to realize thermal states 
in subsystems. Then, regarding phase-random states as 
typical states during a Hamiltonian dynamics, we dis- 
cuss simulating them with matrix product states (MPSs) 
[lo[ by deriving the average amount of entanglement of 
phase-random states. Moreover, applications of phase- 
random states in IQP circuits and as LME states lead us 
to develop a scheme for generating an ensemble of states 
simulating the entanglement of phase-random states by 
a quantum circuit composed of only diagonal two qubit 
unitaries, which we call a phase-random circuit. 

The paper is organized as follows. In Sec. |TT1 we define 
phase-random states and show how they can be used to 
study thermalization in statistical mechanics. In Sec. lIIH 
we derive explicit formula for the average amount of en- 
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tanglement of phase-random states. Using the formula, 
we investigate the simulatabihty of Hamiltonian dynam- 
ics by MPSs in Sec. lIVI Finahy, we introduce and analyze 
phase-random circuits in Sec. |Vl 

II. PHASE-RANDOM STATES AND 
THERMALIZATION 

Given a Hilbert space we denote an ensemble of 
pure states G H distributed according to some mea- 
sure d/i by T = {|V^)}d/i- The ensemble of random states 
is written Trand = {|'^)}d^5 where |?/^) is an arbitrary 
state and dtjj is the unitarily invariant normalized Haar 
measure. For a Hilbert space of qubits, consider states 
of the form 

n=l 

with both the amplitudes {rn \ XI ^ — 0<rn<l} 
and orthonormal basis {|^in)} fixed. By phase-random 
states, we mean the ensemble Tphase = {|0)}d(^7 where 
the phases cpn are distributed according to the normalized 
Lebesgue measure given by 

difi d(p2N 



on [0,27r]^ . This ensemble clearly depends on 
the choice of amplitudes and basis, which we write 
'^phase({^n7 |'^n)}n) whcu there is need to be explicit. 
Note that the ensemble of phase-random ensembles with 
appropriately distributed amplitudes is the ensemble of 
random states. 

We first point out that studies of thermalization in 
closed systems are special instances of the study 
of phase-random states. To see this, consider a Hilbert 
space H with dimension and a Hamiltonian H = 
Xln=i |en)(en|- The state at time t is given by 

n 

where rne*"^"" = (enl^o) with rn > and |0o) is an initial 
state. Then, a time averaged thermodynamical quan- 
tity is often considered by assuming phase ergodicity in 
the sense that the distribution of phases ^-'f^^r.t/h+i^fri ^^^^ 
uniform in [0, 27r] in the long-time limit. Due to this 
identification, all studies addressing the time average are 
equivalent to investigations of statistical properties of the 
corresponding phase-random states Tphase({^n7 \^n)}n)' 
For example, in [1] it was proven that time evolution 
typically gives rise to canonical distributions in subsys- 
tems. In this case, we consider a Hilbert space 
where 1-Ls iJ~LE) represents a system (environment) with 
dimension ds {dE)^ and Hr is a restricted subspace con- 
strained by the energy defined by 

Hr = span{|eQ,) \e — Se < Ca < e ^ Se} C Hs He- 



Then, if an initial state lies in Hr^ the reduced den- 
sity matrix on the system Tr^; \(j){t)) {(f){t)\ should be in a 
neighborhood of ps given by 

dR 

PS = ^rlTiE \ea){ea\ , 

for most of the time t, where = Re(eQ,|(/)o), ^ \ ea) ^ 
and dR = dimHR. We say that for most of the phase- 
random states Tphase ({^n, |en)}n), whcrc fn is deter- 
mined by 1^(0)) and Hr, a reduced density matrix on 
the system is close to ps. 

By evaluating the trace distance between the state ps 
and a canonical state TiE^R/dR where Ir is the identity 
matrix on Hr^ one can obtain the following condition for 
ps to be a canonical state, 

E('^i-^)(^^-^)Tr[e§4]=0, (1) 

a, (3=1 

where e| = Tr^; |e/e)(e/c|. This provides a trade-off rela- 
tion between the initial state and the restricted Hilbert 
space Hr for thermalization. In order to see this, con- 
sider the following two extreme cases for Eq. ([1]) to hold, 
which trivially lead to thermalization in subsystems. 

Firstly, if the conditions are imposed only on the am- 
plitudes, the amplitudes must be equal, i.e. = ^/dR 
for all a. Since the amplitudes are defined by the initial 
state, this is a condition for the initial state to exhibit 
thermalization. On the other hand, if the conditions are 
imposed only on the restricted Hilbert space Hr, the 
eigenstates in Hr should satisfy Tr[ege^] = Tr[eg'e^], 
V \ea) , |e/3) , |ea') , \e/3^) ^ ^r- By choosing a = p = p' , 
we obtain Tr[(eg)^] = Tr[egeg ]. Since the left-hand side 
is a norm of eg and the right-hand side is the Hilbert- 
Schmidt inner product of eg and eg', it implies that all 
reduced density matrices should be identical. Thus, this 
condition on the eigenstates in Hr trivially results in 
thermalization. 

Equation ([T]) gives conditions applicable in interme- 
diate situations between these two extreme cases. It 
therefore provides grounds for the study of the way ther- 
malization depends on a system's initial state and on its 
Hamiltonian [4]. 

III. ENTANGLEMENT 

We investigate the entanglement properties of phase- 
random states using entropic measures of entanglement 
since they can reveal if the state is simulatable by MPSs. 
Consider divisions into two subsystems A and A, com- 
posed of Na and = N — Na qubits respectively. 
We denote the density matrix of by (p = | (/>)((/> | 
and its reduced density matrix on the subsystem A by 
(j)A = Tr^(/) = Tr^ For a given pure state 

and subsystem A, the amount of entanglement in terms 
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FIG. 1: (Color online) The distributions for AT = 8 of the 
amount of entanglement for two ensembles, (a) random states 
and (b) phase-random states with equal amplitudes and a 
separable basis, using the Meyer- Wallach measure of entangle- 
ment £^Mw(|0)) := j^^k=i ^^L^^\\'i^)) ^ where k labels single- 
qubit subsystems pj|. The number of samples is 10^, binned 
in intervals of 0.002. 



Thus we have 
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• (3) 



Denoting the mutual information of <i> between A and A 
in terms of the linear entropy by /{^^(l>) = 6'L(Tr^l>) + 
Sl{^ta^) - Sl{^), Eq. dS]) reduces to 



of the linear entropy is given by E^^\\(j))) = Sl{4^a) 
where Sl{p) = 1 — Trp^, and in terms of the von Neu- 
mann entropy is given by = S{4>a) where 
S{p) = -Trplogp. We have < < 1 - 2'^^. 
The linear entropy gives a lower bound on the von Neu- 
mann entropy, - log[l - E^j^\\(j)))] < E^^\\(l))). Hence, 
we will use the linear entropy to measure entanglement 
in this paper unless otherwise specified. 

For the ensemble of random states T^and on N qubits, 
the average amount of entanglement is 
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indicating that random states of large systems are nearly 
maximally entangled on average For an ensemble of 
phase random states Tphase and a choice of subsystem A, 
the average amount of entanglement is defined by 

which, recall, is a function of {r^, \un)}' Note that in 

general (^^i^^^) Tphase ^^l^\^)-> where <l is the density 
matrix defined by the phase-random ensemble, namely, 

4>= / dLp^ = ^rl\Un){Un\. 

In order to calculate the average amount of entangle- 
ment, we expand the basis elements such that \un) = 

Ea=i \^)a ^ l^n where ... 2a^a is a compu- 

tational basis for subsystem A, d is binary for a — 1, and 
tildes indicate unnormalized l^ets. Defining (j)^^} by 



E^ 



Tr(^y^)^ is given by l^ia'^P- This is quadratic, so 

will involve a sum over four basis labels n,m,/,A:, and 
where the only phase integral that occurs is 



/ 



Equation (|4]) simplifies the investigation of the depen- 

basis. 

First, 



dence of (^^x.^^^) Tphase ^n the amplitudes and the basis. 



We consider two cases in particular, 
we analyze equal-amplitudes ensembles Tp^^^^ = 

Tphase({2~^/^, \un)})' In this case, the average amount 
of entanglement over phases is given by 



/^(^)\ e 



n=l 



22N 



(5) 



This shows that {E^^)^e<i^ is a decreasing function of 

the basis entanglement, E^\\un)). Hence ensembles 
that also have a separable basis denoted by 

T^'phSe = Tphase({2-^/2, |?|-P)}), give the maximum, 



2Na ^ - 1 



This value is greater than that of random states 
given by Eq. (j2j), see also Fig. [H For |(^eq,sep^ _ 
2-A^/2 l^sep^^ applying the concentration of 



measure to A£;[^^ (|(/)^^'^^p)) 
(e[^^)x^^,^^y>\, we find 

^ ^ ' -tphase " 



Prob 



A£;[^^(|0^^'^^P)) > 2/2'^ +e 



< e 



(^|0eq,sep^) 



(6) 



where c = l/(2^^7r^). The proof is similar to that in 
Ref. [U] (see Appendix [Al for details). Thus the entan- 
glement of phase-random states Tp^'^^^ is highly concen- 
trated around the average, demonstrated in Fig. [H 

States with equal amplitudes in a separable basis are 
also known as LME states These are the class of 
multipartite states that are maximally entanglable with 
local auxiliary systems by only local operations. In Ref. 
0, it is mentioned that the LME states should exhibit 
high entanglement. Our result proves this statement is 
true in the sense that the uniform ensemble of LME states 
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achieves a higher average amount of entanglement, in 
terms of hnear entropy, than that of random states. 

On the other hand, for separable-basis ensembles de- 
fined by Tp^tf^gg = Tphase({^n, K''^)}), an upper bound 
of the average amount of entanglement is given by 

where 4>®'^p = XI n \^n^){^n^\' When the number of 
non-zero is is bounded by 1 — from 

above. If R is small, (for instance, if R = poly(A^)), the 
average amount of entanglement cannot be as large as 
that of random states. It is therefore necessary for the 
basis to be entangled in order to generate a large amount 
of entanglement on average when R is small. 

IV. SIMULATABILITY OF HAMILTONIAN 
DYNAMICS 

We now interpret our results in the context of time- 
independent Hamiltonian dynamics and consider the sim- 
ulatability of the state during the time evolution by as- 
suming phase ergodicity. We consider the area law of 
entanglement, which states that the von Neumann en- 
tropy of entanglement of a large subsystem is at most 
proportional to its boundary. Since the breakdown of 
the area law indicates that the states cannot be simu- 
lated by MPSs with a constant matrix size the area 
law gives insight into the simulat ability of the state. The 
area law is often studied for ground states of spin sys- 
tems. It is also known that, initial states that do not 
violate the area law will not do so over a certain time 
scale evolving under a local Hamiltonian [l2|. 

Applying our results to a lattice of qubits, we consider 
the long-time average of the von Neumann entropy of 
entanglement generated by a time-independent Hamil- 
tonian dynamics (£^*^^^)t;oo- Using the facts that the 
von Neumann entropy is lower bounded by the linear 
entropy and the concavity of the logarithm, we obtain 
a lower bound on (£^^'^^)Tphase5 which can be identified 
with (£^*^^^)t;oo under phase ergodicity. Thus, we have 

{E^^^)t-oo > -log[l-(4^^)Tp,_]. By applying Eq. Q, 
we can check the area law in the long-time average. 

In particular, we consider Hamiltonians composed of 
separable eigenstates, which are often referred to as semi- 
classical. When the initial state is a superposition of sep- 
arable eigenstates with equal amplitudes, the initial state 
is also separable, and the corresponding phase-random 
states are Tp^'^^^ which obtains the maximum of Eq. (|5]). 
Thus we obtain 

{E^^^)t;oo >Na- log(l + 2^^--^ - 2^^-^), 

which grows in proportion to the volume Na of the sub- 
system A when Na <C and not with any boundary 
size, and the area law is broken. Since entanglement con- 
centrates around its average during the time evolution as 
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FIG. 2: Phase-random circuit composed of two-qubit un- 
tiaries Wt{it,jt) acting on randomly selected pairs of qubits 
{itjt)- 

in Eq. (|6|), the states are not simulatable by MPSs with 
a constant matrix size for most times. This is surprising 
at first because all eigenstates as well as the initial state 
are separable, however the dynamics generate extremely 
high entanglement and, thus, is difficult to simulate. 

In Ref. [12j , timescales necessary for breaking the area 
law by time evolutions with local Hamiltonians have been 
studied. Combined with our result, we can explicitly es- 
timate the timescale necessary for satisfying phase er- 
godicity when the Hamiltonian is composed of separable 
eigenstates. 



V. PHASE-RANDOM CIRCUIT 

We present a phase-random circuit generating an en- 
semble of states Tp^^^^° that provides the same average 
entanglement as the phase-random ensemble Tp^^^^ = 
Tphase({^a, |^)}a), whcrc |^o) = E ^a^*'^" 1^) is the iuput 
to the circuit and {|a)}^^2 computational basis. 

A phase-random circuit is similar to those considered 
in We consider a circuit composed of T iterations 

of two-qubit unitaries diagonal in the computational ba- 
sis denoted by W^, where the subscript t denotes the 
t-th iteration {t = 1, 2, . . . , T). For each iteration t, the 
two-qubit unitary Wt acts on a pair of qubits i^j{j ^ i) 
randomly chosen uniformly from {1,2,-- - ,A^}, and is 
written 

Wt = czu,,PMPjAlit), (7) 

where CZij = diag(l, 1, 1, — 1) is a controlled- Z op- 
eration on qubits i and j, Pk{0) = diag(l,e*^) de- 
notes a phase gate on the qubit /c, and the two an- 
gles a, P are randomly chosen uniformly from the in- 
terval [0,27r]. A specific instance of the circuit is de- 
scribed by the set {it^jt^<^t^Pt}tLi^ and the correspond- 
ing output state after T iterations of Wt is given by 
|(/)t) = WTWr-i-'-Wilcl^o). where |(/)o) = EraC'^^ \a) 
is an input state in the computational basis defin- 
ing the ensemble T^^^^^. 
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A. Summary of results 

Here we state the main results, with the details of the 
proof to follow. Denote by 'E[Ej^\\(f)T))] the expectation 
value of E^j^\\(j)T)) taken over the uniform distribution 
of {it^jti OLt^ /3t}t=i- We will prove the following two the- 
orems regarding the ability of 'E[E^\\(f)T))] to equal the 
average of phase random states (^^^^hycomp after suffi- 

phase 

ciently many iterations, and about the required number 
of iterations. 

Theorem 1 With the preceding definitions and nota- 
tions, 

lim_E[4'^)(|^T))] = {4^^)T-f> (8) 



with 



a, 6 leA ieA 

+ E^a, (9) 

a 

where aia2 - ■ - aN (^1^2 • • • ^at) ^ {0, 1}^ is a binary rep- 
resentation of a — 1 (6 — 1). 

Theorem 2 Let Tmix(e) be the number of iterations re- 
quired to achieve Eq. (|8j) with error e, namely. 



VT > Tmix(e), 



E[4''^(i0T))]-(^roT-- 



< e. 



For r C {1, • • • ,7V}, define i^^^\\(I)q)) such that 



a/6 



Then, if maxr a^(^)(|0o)) = 0(2-^), Tmix(e) is polyno- 
mial in the system size for any A. In particular, for 
~ Pa (TV) 2 where {pa{N)}a are polynomial func- 
tions of Tmix(e) is poly{N). 

These results are especially interesting if we consider 
an ensemble Tp^^^^° simulating the average amount of 
entanglement of Tp^'^^^. Since the average entanglement 
of Tp^'^g^ violates the area law, most states in Tp^®g^^° 

do also. Hence, Tp^^^^° are not simulatable by MPSs 
although they are generated by a quantum circuit with 
a polynomial number of elementary gates. 

Here, we have focused on the generation of the aver- 
age amount of entanglement of phase-random states. In 
a separate paper [15], it is shown that phase-random cir- 
cuits can approximately generate an ensemble simulating 
the states themselves. 

In the following, we prove Theorems [1] and [2] by adapt- 
ing the method developed in [i3|,[i3 the phase-random 



case. In this method, the key technique is to map the 
evolution of the states in the phase-random circuit to a 
Markov chain, and so we first briefiy review Markov pro- 
cesses in Subsection IV B[ In Subsection IV C\ we present 
the map to a Markov chain, and then investigate its sta- 
tionary distribution. As, contrary to [i3|,[l3, the Markov 
chain is not irreducible in our case, we first decompose 
it into irreducible Markov chains in Subsection IV Dl In 
order to calculate the average amount of entanglement, it 
is sufficient to consider reduced Markov chains, which are 
presented in Subsection lVEl By investigating the station- 
ary distribution of the reduced Markov chain, we finally 
obtain the average amount of entanglement after T steps 
in Subsection IV Fl The mixing time Tmix(e) for achieving 
Eq. (jS]) is treated in Subsection IV Gl 



B. Introduction of a Markov chain 

A Markov chain is a sequence of random variables that 
take values in a set of states S = {s}, indexed in our case 
by discrete steps t. The Markov property is that the 
probability of 5t+i occurring depends only on St, and is 
independent of previous states. We can define at any step 
t a probability distribution H^ over the states space S. 
The Markov property then ensures that subsequent dis- 
tributions are related only to the previous distribution, 
and that this dependence can be given in the form of 
a step-independent, stochastic transition matrix with 
matrix elements denoted by V{s^ s'). Thus, the probabil- 
ity distribution at step t is given by Ht = T^^Hq, where 
Ho is an initial distribution. 

When a Markov chain is irreducible and aperiodic^ the 
probability distribution on each state converges after suf- 
ficiently many steps. That is, for all 5, there exists a 
unique Hoo(5) = hm^^oo ^t{s) that is independent of the 
initial probability distribution. Irreducibility is a prop- 
erty of the transition matrix implying that any state s 
can transition to any other state in a finite number of 
steps, that is, for all s and s\ there exists a t such that 
V^{s^s') > 0. Aperiodicity implies that, for all states 
s, there exists a non-zero probability to remain in that 
state, namely, V{s^ s) > for all s. A sufficient condition 
for a distribution to be stationary is given by the detailed 
balance condition 

U{s)V{s, = U{s')V{s\ s), for ah 5, G S. 

When a Markov chain satisfies the detailed balance equa- 
tions, it is referred to as reversible. 

Next, we define the mixing time^ which is the number 
of Markov chain steps required for the distance between 
the actual distribution and the stationary distribution 
to be small, where we define the distance between two 
probability distributions as follows. Let S{so) be an ini- 
tial probability distribution of a Markov chain with value 
1 at So and zero elsewhere on the state space S. Let us 
denote the sum of the probabilities of a distribution over 
a subset of states by H(6'') = Xls^5/H(s), and by 



6 



11^(5'' I (5(^0 )) such a sum at step t of a Markov chain that 
initiahzed with the distribution S{so). The variation dis- 
tance after t-steps is defined by 

A,„(t) :=max|n*(5V(5o))-n«3(^')l- 

The mixing time Tmix(e) is then defined for any e > by 

Tmix(e) := min{t| m8ixAs^{t') < e for ah t' > t}. 

soes 

This is the number of steps it would take to get e-close 
to the stationary distribution in the worst case. In prac- 
tice, we do not actually use this definition of the mixing 
time, but rather the following Theorem [3] and Corollary [1] 
regarding the transition matrix. 

For a transition matrix 7^ of a reversible Markov chain, 
let us label the eigenvalues of V in decreasing order such 
that 

1 = Ai > A2 > • • • . 

Then, 7^ := 1 — A2 is called its absolute spectral gap. The 
absolute spectral gap r] gives an upper bound on the mix- 
ing time as stated in the following theorem. 

Theorem 3 (Theorem 12.3 in (III) Let V be the 

transition matrix of a reversible Markov chain on and 
let n(min) := min5^5'n(s). Then 

T^ix(6) <log( / . )-. 

en(mm) r] 

Moreover, a lower bound on the absolute spectral gap r] 
is obtained by the canonical path method. Viewing a re- 
versible transition matrix 7^ as a graph with vertex set 
define the edge set E = {{s^ s')\V{s^ s') > 0}. A canon- 
ical path from s to is a sequence Ess' = (^i, • ' ' ? ^m) 
of edges in E such that ei = (5,51), 62 = (si,52), 
Cm = {sm-i^ s') for vertices s^, i = 1, 2, • • • , m. We have 
the following Corollary [H 

Corollary 1 (Corollary 4 in (l7j |) For a given transi- 
tion matrix let Q{s^s') := 1100(5)7^(5,5') and 

''~fiiW) ^ noo(s)noo(s')- (10) 

Then 



By combining Theorem [3] and Corollary (TJ an upper 
bound on the mixing time can be obtained. 



C. Map to a Markov chain 

We will now show that the change in the state |(/>t) ^ 
\(j)t+i) upon the application of the two-qubit unitary 
Wt+i defined by Eq. ([7]) can be formulated in terms of a 
transition matrix action on the indices of expansion coef- 
ficients of the state in the basis of local Pauli operators. 
The hermiticity of this basis ensures that the coefficients 
are real, and hence their square gives a valid probability 
distribution, while its locality ensures that we can focus 
on the qubits i and j where Wt+i acts, eventually sim- 
plifying the calculation of the linear entropy. 

Let us consider the expansion of given by 

,qN 

where qi G {0,x,?/,z} and aq. are Pauh operators. We 
denote (<7i,--- ^Qn) by the vector q. We construct a 
Markov chain defined on {q} in which the probability 
distribution is given by the expectation value of ^|(q) 
over at and which is denoted by E[(f^(q)]. For this 
purpose, we first examine E[^|(q)], and then construct 
the Markov chain. For simplicity, hereafter we omit the 
step indices on qubits and write (i^j). 

By applying Wt+i on a randomly chosen pair of qubits 
(i, j), the coefficients {^t+i(q)} of the state | 
become 

Tr[crp^ ^ (Jp.Wt^iaq. cr^,- 

where pp.^q.^p.^q. is p but with components (pi^pj) re- 
placed by (^i, Qj). Squaring this to arrive at a probability 
distribution, we have 

qi,qj, 

X 6(Pp,^(?^p,-^gpGt+i(p,q, q ), 

(11) 

where 

<^t+i(p,q,q ) — 

Tr[crp^ (g) ap^Wt^iaq. (g) cTq^W^^^] 

X TT[ap^ cTp.Wt+iaq^^ ^ ^g^^Z+i]- 

In order to see that Gt+i defines a transition matrix, it 
is important to recognize that it treats the sets of Pauli 
indices {0, 2;} and {x^y} equivalent ly. We write wqz and 
Wxy for arbitrary elements of each set respectively, and we 
define an involution -1 as -lO = z and = y. Averaging 
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Pi 





X y 


z 







At 


Ct{0) Ct{0) 


At 




X 


Bt{0) 


Dt Dt 


Bi[z) 




y 


Bt{0) 


Dt Dt 


Bt{z) 




z 


At 


Ct{z) Ct{z) 


At 



TABLE I: Table of E[^?+i(p)| \(j)t)] as a function of p^ and pj. 
over at^ Pt^ we obtain 



E[Gt+i(p,q,qO] = ^^qq' X 



where each case is defined by 



16 case I 

8 case II 

4 case III 

otherwise, 



(12) 



case {Pi = qi= wqz) A {pj qj wqz) 
case II <^ {^pi =qi = wqz) A {pj.qj = w^y) 

V {Pi.qi Wxy) A {^pj = qj = wqz) 
case III ^ pi,qi,pj,qj = Wj^y . 

By substituting E[Gt+i(p, q, q')] into Eq. (fTT|) . we ob- 
tain E[^|^^(p)| |0t)], the expectation value conditional on 
state l^t), with values as shown in Table |T] where 



At = Up), 



w'—x^y 



w'—x^y 



w—x,y w'=x,y 

We are now prepared to define a Markov chain: 







Probability 






1 






1/2 




{^qi,y) 


1/2 


(Wxy^WOz) 




1/2 






1/2 


{^U)xy 5 Wxy^ 




1/4 






1/4 






1/4 




{y,y) 


1/4 



TABLE II: Transition probabilities, 
given by ^g(p), 

ni(p) = ^np,r)no(r) 

r 

= ^P(p,r)^o2(r) 

r 

= E[C?(p)||^o>], 

where the last equation is obtained using Table |T] with 
the definition of the Markov chain M.. By induction on 
Proposition 1 is proven. For example 

n2(p)=5]P(P,r)ni(r) 

r 

= ^P(p,r)E[C?(r)||,^o)] 

r 

= E[^P(p,r)^2(^)||^^^] 

r 

= E[e|(p)||0o)]. 

We recall that a probability distribution 11 can be 
viewed as a vector in a 4^-dimensional space, which we'll 
call Vs^ where 5* = {0, x, z}^. For a given the set 
of all possible lit comprise the probability simplex in Vs 
defined by Xlpn^(p) = 1. The transition rules given 
in Tables IIIIII define a transition matrix V on Vs with 
matrix elements written as 7^(q, p). 



Definition 1 (Markov chain M) Let be a 

Markov chain on a set 6* = {0,x,?/,z}^ = {q}. The 
transition process is described as follows. In each step, 
i and j are randomly chosen from {1, • • • , A^} and the 
transition from q G 5* to p G 5* occurs probabilistically 
according to Table [III The transition probability from q 
to p and the probability distribution over p after t steps 
are denoted by 7^(q, p) and nt(p), respectively. The 
initial distribution no(p) is identified with ^o(p)- 

Proposition 1 The probability distribution nt(p) of 
the Markov chain Ai coincides with E[^|(p)| |(/>o)]- 

Since the initial distribution of the Markov chain Ai is 



D. Irreducible decomposition of the Markov chain 

In this subsection, we give the irreducible decomposi- 
tion of By the definition of the Markov chain A4, 
it is obvious that the number of x and in q is invari- 
ant under the action of the transition matrix V. Thus 
we obtain the irreducible decomposition of Vs given by 
Proposition [2] (see also Fig. [3]). 

Proposition 2 (Irreducible decomposition of S) 

For q = qiq2 - ■ • qN ^ S = {0, x, 7/, z}^, let X(q) be the 
sequence {i G [1, • • • , N]\qi G {x, y}} and let S{T) be the 
set defined by 

S{T) := {q|X(q) = T}, 



xyO 

TT 



Hi 

xxz 



BJl 



yxO 



i—Q. 



yxz 

T 



\ 



oooIj 



DJL 



zOO 



/T 



xyz 



\yyz 

TTT- 



0^0 



zzz 



FIG. 3: (Color online) An example of some irreducible sets of 
M when N = 3. Directed lines imply the transition occurs 
with a fixed probability. The probability of blue (dotted), 
red (solid), green (dashed) and purple (dashed-dotted) lines 
is 1/6, 1/12, 1/3 and 1, respectively. Elements such as 000, 
zOO and so on are invariant under the Markov process. 



where F is any subset of {1,2,- •• ,A^}. Then, for the 
Markov chain the irreducible decomposition of Vs is 
given by 



Vs 



^{q} e ^5(r), 



qG5(0) r/0 

where Vs' is the vector space defined by the subset S' . 

Since V|q} is always one dimensional by definition, we 
have that Uti^) = no(q) for all q G 6'(0) and for all t. 
Thus nt(q G S'(0)) is given by 

n,(q)=no(q)=eo(q)' 

= 2-^ ((/>okq|0o)' 
AT 

a,b i=l 

(13) 

where we have used the fact that, for q G 5'(0), qi G {0, z} 
for all i and <jq := CTg^ • • • (8) cr^^ . 



E. Reduction of the Markov chain 



In order to describe the evolution of E^j^\\(j)t)) ^ a full 
investigation of the Markov chain A4 is not necessary due 
to the definition of the linear entropy Sl{p) = 1 — Trp^. 
This can be seen by considering the reduced density ma- 
trix of 1 0t) on a subsystem A. 

4'^=Tr^l^t)(0t| 



and Tr((^^^)^ is given by 



q s.t. 

qi^O,ieA 



?(r)(7,7) P(')(7 + 1,7 + 1) V^'\N,N) 

^^'^7,7 + 1) fl ?(r)(^ + l,^ + 2) P^^^W^-l) 



^7 



7 + 1 



N 



?(^)(7 + l,7) 



p(^)(7 + 2,7 + l) ?(r)(iV-l,iV) 



FIG. 4: (Color onhne) Graph of the Markov chain Mr- Di- 
rected colored lines imply transition occurs with a fixed prob- 
ability. Transition probabilities are given in Proposition 3. 



Hence, its expectation value is 

E[Tr(<^({))2||0o)] = 2^-* Yl mi^fWM] 

q s.t. 

qi=0,ieA 

= 2^- Y n,(q). 

q s.t. 

Thus, it is sufficient to investigate nt(q) for q such that 
= for i G A. The only important property is the 
number of non-zero terms in q. For this reason, let us 
define the set x*^^H^) 

x'^Hq) - {ie[i,--- ,NUj^o,qeS{r)}, 

which indicates the positions of non-zero terms in q G 
S{T). Using this notation, the expectation value is writ- 
ten by 

E[TV(^i*))2||0o)]=2^^^ J2 n*(q). (14) 

rcA q s.t. 

Since nt(q) for q G S{9) is already given by Eq. ([13]), we 
consider only q G 6'(r) for F 7^ 0. 

For this reason, we can reduce the Markov chain A4 
to a simpler Markov chain Air- For a given number of 
X OT y entries 7 := |F| in q, the number of non-zero el- 
ements Ix*^^^ (q)| can take values {7, 7 + 1, • • • , N}. The 
new Markov chain is a drunkard's walk on this set (see 
Fig,|4|), with transition probabilities given by the follow- 
ing proposition. 

Proposition 3 For the Markov chain Air defined on 
{i G {7, •••A^}}, a transition from i to j occurs with 
probability. 



V^^\i,j = i-1) 



27(i - 7) 



N{N -1)' 

' iV(iV-l) 

This is directly induced from the definitions of the 
Markov chain M. and Ai. For instance, the transition 
i ^ i -\- 1 in Air occurs if and only if {qi^qj) = {wxy^O) 
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or {qi^Qj) = {O^Wxy) in A^. As the number of zeroes is 
N — i and the number of w^y = 7, its probabihty is given 

Ar(Ar-i)/2- 

As stated in the introductory subsection, since the 
Markov chain A^r is irreducible and aperiodic, it has a 
unique stationary distribution 11^ , which is determined 
by the detailed balance condition and the normalization. 
The detailed balance condition gives the equation 

i^'P^^^ (i, i + 1) = n(^) {i + i)p(r) (i + 1, i). 

Using this equation, we obtain 

n£)«=(^_-^^)n(^)(7). (15) 

The normalization in S{T) depends on the input state 
|0o) as 

N 

EnSH^)= E no(q)= E ^o(q). m 

i=7 qG5(r) qG5(r) 

On the other hand, Eq. ([15]) gives 

N 

^n(^)(z) = 2^-'^n(^)(7). 

Hence, the stationary distribution is given by 



2N-'y U - 7 



qe5(r) 



Recalhng that ^o(q) — l</^o), it is not difficult to 

compute EqG5(r) ^o(q). which gives 

E ^o(q)=E-^'n(i-^-^^)n^-^^- 

Thus the stationary distribution for a given subset F is 
V ^ ^ J a^b ier i^r 

(17) 



F. Calculation of liniT^oo E[5'r^(|(/)T))] 

We will now calculate the large time limit of 
the expectation value of the amount of entanglement 
limT^oo using the results of the previous 

two subsections. From Eq. (fT4|) we have 

E[IV(0jVll0o)] 



2^^^ ^ n^(q) 



rcA q s.t. 



E + E E 

'- q s.t. ADVyit/) q s.t. ■ 

X<'"(q)CA X<^'(q)CA 



nT(q). (18) 



The first term in Eq. ([T8|) is calculated from Eq. (p!3|) 

as 

E nr(q) 

q s.t. 

x<'"(q)c/i 



q s.t. a, 6 i=l 

X<'"(q)CA 

=2-^-E-^'n'^-''^' 

a, 6 ieA 

where the last expression is derived from the relation 

N 



q s.t. i=l 



ieA 

The second term in Eq. ([18]) is obtained from the sta- 
tionary distributions n^'^(z) given by Eq. (pT|) . From the 
definition of the Markov chain A^, for q, G aS'(F), if the 
number of z in q is equal to that in q', noo(q) = Iloo{<l^)^ 
so that 



E E noo(q) 



q s.t. 

X^'^^(q)CA 

E E (N--f\ ^ioH'^) 

= E E''>^'E^^(''.-7)n^-^.n(i-^-''.) 

= 2-^-E-argn^-''^' 
where we have used the relation 

Combining the two we arrive at the final expression 
lim_E[Tr(<^J))2||0o)] 



T^OO 



=E''arg(n^-^.+n^-''.)-E-a, 



a,b ieA 



ieA 



and since £;[^^(|(/)t)) = l-Tr(^J^)^ Eq. (O is obtained. 



G. Mixing time 

In this final subsection we bound the mixing time of 
the Markov chain Air using Theorem [3] and Corollary [TJ 
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From Eq. ([T7j), the stationary distribution in 5'(r) is 
given by 



z — 7 



where we have introduced the notation i<i'^^\\(j)o)) := 

Ea^b^l^bUieri^ - ^a.bJUi^r^a^b.' Thus n£^(min) is 
given by 



n^^(min) = min 



e{7,---,N} I i - 7 

Let p^^^ be the expression defined by Eq. (p!Q]) for the 
Markov chain Mr. An upper bound of p*^^^ for TWr is 
then given by 



,(r) < 



Since the graph of the Markov chain A^r is hnear, as 
shown in Fig. IH an upper bound on the maximum of 
iJ n£^(i)n£^(j) is given by 



Sin3e 



Y: n(S)(z)n(^)o-) 



Sii3e 



< 



2A/'— 7 



N 



2^-^ / ie 



max y (^-7) y (^-7) 



x=0 



y=0 



On the other hand, the miue^^; Q^^H^) factor can be 
computed as follows. In the Markov chain A^r, edges are 
of the form (i, i + 1) or (i, i — 1). By the symmetry of the 
linear graph, one sees that Q^^)(Ar+7-i,7V+7-z-l) = 
(5^^^(z, i + 1), and it is sufficient to consider the minimum 
of Q'^^^i^i -\- 1)^ which is given by 



Thus p^^^ is bounded from above as 



,(r) < 2"-^iV(iV-l) (r)( 
' - 27(iV-7) ^ 



*o)). 



In order to achieve 
VT > T„i,(e) 



E[4^)(i0^))]-(4^))~ 



< e 



for all A, it is sufficient for each Markov chain A^r to 
converge with error := since the linear entropy is 

the sum of the stationary distributions in A^r as shown 
by Eq. (p!8]) . Therefore, from Theorem [3] and Corollary 
[U we obtain an upper bound on Tmix(e) given by 



Tmix(e) < max 



N{N -1) 

X 



7V-7-log(^-/.(^)(|(/>o))) 



This is dominated by the factor [2^-^ f^^^W(j)o))]^ . Thus, 
maxr A>:*^^^(|0o)) = 0{2~^) is sufficient for Tmix(e) to be 
a polynomial in TV. 

This concludes the proof. 



VI. SUMMARY 

We have deffned phase-random states as an ensemble 
of states with fixed amplitudes and with uniformly dis- 
tributed phases in a fixed basis. We have discussed their 
use for the realization of canonical distributions in sta- 
tistical mechanics. We then derived a general formula 
for the average amount of entanglement of phase-random 
states. Applying these results, we have argued for the 
simulatability of time evolving states by a Hamiltonian 
dynamics, and have shown the difficulty of their simu- 
lation for semi-classical Hamiltonian systems by MPSs. 
Finally, we have proven that an ensemble of states that 
provides the same average entanglement of phase-random 
states can be generated efficiently by a phase-random cir- 
cuit composed of relatively simple gates. 

We acknowledge V. Vedral for useful comments. This 
work is supported by Project for Developing Innova- 
tion Systems of MEXT, Japan and JSPS by KAKENHI 
(Grant No. 222812, No. 23540463 and 23240001). 



Appendix A: Concentration of measure 

In this appendix, we show that for the ensemble of 
phase-random states with equal-amplitudes in a separa- 
ble basis, Tp^'^g^, the amount of entanglement is highly 
concentrated around the average. Formally, by defin- 

ing a4^)(|(/>-^'-p)) := |4^^(|(/>^^'-P)) - (4^^)t--| 

where |^^^'^^p) = e''^" K^), we prove that 

Prob[A£;[^^(|(^^^'^^P)) > 2/2^ + e] < exp[-ce'^2^] where 

C=1/{2^^7T^). 

First, for two states in the ensemble Tp^'^^^ de- 
noted by |(^) = 2-^/2 e^^« and = 
2-^/2 ^^e^^^ KP), let us define the distance 00 
between them in the parameter space [0, 27r)^ by 



1 y^ l'/'g-CI 
27r ^ 2^ 
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Then, using the theorems in Appendix C of Ref. [l[, we 
obtain the upper bound of the concentration function 
(^d{r) by 



a,(r)<exp[--2^], 



(Al) 



where the concentration function ad{r) imphes that, 
for any subset A G [0,27r)^ with measure 1/2, its r- 
neighborhood with resp ect to the metric d has mea- 
sure at least 1 — ad{r) |18j . 

Now, we evaluate the amount of the change in the pa- 
rameter space necessary to change A^^^^* more 
than e, which is obtained from the following proposition; 



Proposition 4 For \^) and \^') G T;^;^^"^, 

|a4^)(|</>)) - a4^\|(/>0)| < 4V^Vd(^). (A2) 

Proof 1 Using the notation (j)A = Tr^ \ we calcu- 

late 

|a4'^)(|</.))-A4'^)(|</.'))| 

<i4^)(i^»-4'^)(i^'»i 

= |TV<^^-Tr^'^| 

= \Tr{4>A-<P'A){4>A + '^'A)\ 

< II '^A-'^' A II 2 II + A II 2 (A3) 
<2Dhs{4>aJ'a) 

<2Dhs{\4>){^\,\4>'){<P'\) (A4) 

< 2110) -10') I, 



where ||74||2 := VTtAA^ is the Hilbert- Schmidt norm and 
Dhs{AB) = \\A- B\\2. Inequalities (|A3)) and ([Al]) are 
obtained using Cauchy- Schwartz and Kadison 's inequal- 
ities respectively. Since | |0) — |(/>') | < \/4:7rd{(j), (j)') 
dl, we obtain Eq. 



Hence, in order to change /S.E^^\\(jf'^'^^'^)) more than 

e, d{(j)^(j)^) must be changed more than Combining 
this with the concentration of measure given by ()Aip , we 
obtain 

Prob[A4^^(|(/)^^'^^P)) > /iM(A4^^(|(/)^^'^^P))) + e] 

<exp[-ce^2^], (A5) 

where /iM represents the median and c = l/(2^^7r^). By 
using Markov's inequality and the convexity of ^/x, the 
median is bounded from above such that 

/xm(a4'^)(|0<''1'-p))) < 2(A4'^))x;..sep 



<2V{(A4^))2)^ec,..p 



-Y<eq,sep 
phase 



where cr^^cA) is the standard deviation of Ei'^^ Since 
the standard deviation cr^(A) for Tp^'^^^ can be directly 
calculated and is upper bounded by 2~^, we obtain 



Prob 



< exp[— ce 2 
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